

Type 


L # 


Hits 


Search Text 


DBs 


1 


BRS 


LI 


39 


(search$3 quer$3 retriev$3) 
same cach$3 same (reus$3 
AND count$3 AND quer$3) 


US- 

PGPUB, 

USPAT, 

USOCR, 

EPO; 

JPO; 

DERWEI 

T; 

IBM TI 
B 


si 


2 


BRS 


L2 


3910 


((search$3 OR query OR 
queries OR brows$3 OR 
retriev$3) SAME optimiz$3) 
AND (cache$l OR (temporar$4 
SAME storage$l) ) AND 
( (access$3 OR visit$3 OR 
retriev$3) SAME (count$3 OR 
number$l) ) 


US- 

PGPUB, 

USPAT, 

USOCR, 

EPO; 

JPO; 

DERWEI 

T; 

IBM TI 
B 


si 


3 


BRS 


L3 


1282 


((search$3 OR query OR 
queries OR brows$3 OR 
retriev$3j same optimiz$3) 
AND (cache$l OR (temporar$3 
SAME storage$l)) same 
( (access$3 OR visit$3 OR 
retriev$3) same (count$3 OR 
number$l ) ) 


US- 

PGPUB, 

USPAT, 

USOCR, 

EPO; 

JPO; 

DERWEt 

T; 

IBM TI 
B 


si 
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Type 


L # 


Hits 


Search Text 


DBs 


4 


BRS 


L4 


108 


( (search$3 OR query OR 
queries OR brows$3 OR 
retriev$3) SAME optimiz$3) 
same (cache$l OR 
(temporar$3 SAME 
storage$l) ) same ( (access$3 
OR visit$3 OR retriev$3) 
same (count$3 OR number$l)) 
AND (threshold$l OR 
condition$l ) 


US- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; 

DERWEN 

T; 

IBM TD 
B 


5 


BRS 


L5 


108 


3 and 4 


US- 

PGPUB, 

USPAT, 

USOCR, 

EPO; 

JPO; 

DERWEl 

T; 

IBM_TI 
B 


D 


6 


BRS 


L6 


108 


2 and 5 


US- 

PGPUB, 

USPAT, 

USOCR, 

EPO; 

JPO; 

DERWEl 

T; 

IBM TI 
B 


D 
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Type 


L # 


Hits 


Search Text 


DBs 


7 


BRS 


L7 


14 


2 and 1 


US- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; 

DERWEN 

T; 

IBM TD 
B 


8 


BRS 


LIO 


137 


"707/10. eels, and 
((search$3 OR query OR 
queries OR brows$3 OR 
retriev$3) SAME optimiz$3) 
AND (cache$l OR (temporar$3 
SAME storage$l)) AND 
( (access$3 OR visit$3 OR 
retriev$3) SAME (count$3 OR 
number$l)) AND (threshold$l 
OR condition$l) AND 
(reset$3 OR reinitialz$3 OR 
re-initializ$3 OR initial 
OR defaut$l) 


US- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; 

DERWEN 

T; 

T RM T H 
1 DlYl 1 U 

B 


9 


BRS 


Lll 


78 


"707"/104.1.ccls. and 
( (search$3 OR query OR 
queries OR brows$3 OR 
retriev$3) SAME optimiz$3) 
AND (cache$l OR (temporar$3 
SAME storage$l) ) AND 
( (access$3 OR visit$3 OR 
retriev$3) SAME (count$3 OR 
number$l)) AND (threshold$l 
OR condition$l) AND 
(reset$3 OR reinitialz$3 OR 
re-initializ$3 OR initial 
OR defaut$l) 


US- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; 

DERWEN 

T; 

TRM TH 
X SjVI 1 U 

B 
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Type 


L # 


Hits 


Search Text 


DBs 


10 


BRS 


L12 


10567 


(search$3 same cach$3) 


us- 

PGPUB, 

USPAT, 

USOCR, 

EPO; 

JPO; 

DERWEf 

T; 

IBM TI 
B 


i 
) 


11 


BRS 


L13 


13 


(search$3 same cach$3) same 
count$3 same threshold$l 


US- 

PGPUB, 

USPAT, 

USOCR, 

EPO; 

JPO; 

DERWEt 

T; 

IBM TI 
B 


3 


12 


BRS 


L14 


2964 


709/201-203. eels, and 
(search$3 quer$3 retriev$3) 
and cach$3 


US- 

PGPUB 

USPAT 

USOCR 

EPO; 

JPO; 

DERWEI 

T; 

IBM TI 
B 


3 
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Type 


L # 


Hits 


Search Text 


DBs 












US- 












PGPUB; 












US PAT; 










709/201-203. eels, and 


USOCR; 


13 


BRS 


L15 


7 


(search$3 quer$3 retriev$3) 
and cach$3 and (reus$3 same 
count$l) 


EPO; 
JPO; 
DERWEN 
T; 

IBM TD, 
B 












US- 












PGPUB; 












US PAT; 










709/217-219. eels, and 


USOCR; 


14 


BRS 


L16 


6 


(search$3 quer$3 retriev$3) 
and cach$3 and (reus$3 same 
count$l ) 


EPO; 
JPO; 
DERWEN 
T; 

IBM_TD 
B 






- 






us- 












PGPUB; 












US PAT; 










715/501. l.ccls. and 


USOCR; 


15 


BRS 


L17 


1 


(searcn$3 quer$3 retriev$3) 
and cach$3 and (reus$3 same 
count$l) 


EPO; 
JPO; 
DERWEN 
T; 

IBM TD 
B 
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Type 


L # 


Hits 


Search Text 


DBs 


16 


BRS 


L18 


i 


7 15/513 . eels . and (search$3 
quer$3 retriev$3) and 
cach$3 and (reus$3 same 
count$l) 


US- 

PGPUB, 

USPAT, 

USOCR, 

EPO; 

JPO; 

DERWEf 

T; 

IBM TI 
B 




17 


BRS 


L20 


3 


(web same (search adj 
engine$l) ) and cach$3 and 
(reus$3 same count$l) 


us- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; 

DERWEN 

T; 

IBM TD 
B 


18 


BRS 


L22 


17 


quer$3 and (web and (search 
adj engine$l) ) and cach$3 
and (reus$3 same count$l) 


US- 

PGPUB, 

USPAT, 

USOCR, 

EPO; 

JPO; 

DERWEI 

T; 

IBM TI 
B 


5 
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Type 


L # 


Hits 


Search Text 


DBS 


19 


BRS 


L23 


1 


( (search$3 retriev$3 
quer$3) with (condition$l 
criteria) ) and (cach$3 with 
(quer$3 near result$l) ) and 
(reus$3 same count$l) 


US- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; 

DERWEN 

T; 

IBM TD 
B 


20 


BRS 


L24 


174 


((search$3 retriev$3 
quer$3) with (condition$l 
criteria) ) and (cach$3 with 
(quer$3 near result$l)) 


US- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; 

DERWEN 

T; 

IBM TD 
B 


2.1 


BRS 


L26 


182 


707/3, 10. eels, and 
((search$3 OR query OR 
queries OR brows$3 OR 
retriev$3) same optimiz$3) 
AND (cache$l OR (temporar$3 
SAME storage$l)) same 
( (access$3 OR visit$3 OR 
retriev$3) same (count$3 OR 
number$l) ) 


us- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; 

DERWEN 

T; 

IBM TD 
B 
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Type 


L # 


Hits 


Search Text 


DBS 


22 


BRS 


L29 


4 


711/135 . eels . and 
( (search$3 OR query OR 
queries OR brows$3 OR 
retriev$3) same optimiz$3) 
AND (cache$l OR (temporar$3 
SAME storage$l)) same 


US- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; . 

DERWEN 

T; 

IBM TD 
B 










( (access$3 OR visit$3 OR 










retriev$3) same (count$3 OR 
number$l ) ) 


23 


BRS 


L30 


27 


711/113 . eels . and 
((search$3 OR query OR 
queries OR brows$3 OR 
retriev$3) same optimiz$3) 
AND (cache$l OR (temporar$3 
SAME storage$l)) same 
( (access$3 OR visit$3 OR 


US- 

PGPUB; 

USPAT; 

USOCR; 

EPO; 

JPO; 

DERWEN 

T; 

IBM TD 
B 










retriev$3) same (count$3 OR 
number$l) ) 
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Relevance scale 



1 A comprehensive proj ect for CS2: combinin g key data structures and al g orithms int< 
<^ inte g rated web browser and search en g ine 
Tia Newhall, Lisa Meeden 

February 2002 ACM SIGCSE Bulletin , Proceedings of the 33rd SIGCSE technical 

symposium on Computer science education SIGCSE '02, Volume 34 Is 
Publisher: ACM Press 
Full text available:^ pdf( 450.67 
KB) 



Additional Information: full citation , abstract , references . cjti.r 



We present our experience using a large, real-world application as a course project for tl 
second half of the semester of a CS2 course. Our primary goal for the project was to ere 
an engaging application that incorporated most of the key data structures and algorithm: 
introduced in the course. Specifically, the project uses binary search trees, priority queui 
hash tables, and graphs. The project consisted of four parts combined to build an integre 
web browser and search engine in Java. ... 

2 Reusable software components 
^ Trudy Levine 



September 2005 ACM SIGAda Ada Letters, Volume XXV Issue 3 
Publisher: ACM Press 

Full text available: 18 pdf(148.33 
KB) 



Additional Information: full citation , abstract , index terms 



This column consists of our yearly listing of sources for reusable software components. A 
always, no recommendation or guarantee by this column is implied. 

3 Ada software reuse: Reusable software components 
^ Trudy Levine 



August 2006 ACM SIGAda Ada Letters, Volume XXVI Issue 2 
Publisher: ACM Press 
Full text available: ffl pdf(101.52 
KB) 



Additional Information: full citation , abstract , index terms 



This column consists of our yearly listing of sources for reusable software components. A 
always, no recommendation or guarantee by this column is implied. 
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4 Reusable software components 
^ Trudy Levine 

V June 2005 ACM SIGAda Ada Letters, Volume XXV Issue 2 
Publisher: ACM Press 

Full text available: || pdf(109.1 1 Additjona , information: full citation , abstract , index terms 

This column consists of our yearly listing of sources for reusable software components. A 
always, no recommendation or guarantee by this column is implied. 

5 Reusable software components 
^ Trudy Levine 

V March 2005 ACM SIGAda Ada Letters, Volume XXV Issue 1 
Publisher: ACM Press 

Full text available: &pdf(m07 Addjtjona| | nformation: fu „ cit ation . abstract , index terms 

This column consists of our yearly listing of sources for reusable software components. /* 
always, no recommendation or guarantee by this column is implied. 

6 Customizing information capture and access 
^ Daniela Rus, Devika Subramanian 

^ January 1997 ACM Transactions on Information Systems (TOIS), Volume 15 issue l 
Publisher: ACM Press 

Full text available: ffl pdf(126MB) Additional lnformation: M^iQQ ; ^gt. references , citir 

index terms , review 

This article presents a customizable architecture for software agents that capture and ac 
information in large, heterogeneous, distributed electronic repositories. The key idea is t< 
exploit underlying structure at various levels of granularity to build high-level indices wit 
task-specific interpretations. Information agents construct such indices and are configure 
a network of reusable modules called structure detectors and segmenters. We illustrate < 
architectu ... 

Keywords: information gathering, software agents, table recognition 



The V-Way Cache: Demand Based Associativity via Global Replacement 
Moinuddin K. Qureshi, David Thompson, Yale N. Patt 

May 2005 ACM SIGARCH Computer Architecture News , Proceedings of the 32nd 
Annual International Symposium on Computer Architecture ISCA '05, v< 

33 Issue 2 

Publisher: IEEE Computer Society, ACM Press 

Full text available:^ pdf (231 .93 A ^*; , , r *• r „ - 4 u * * • i 

KB) Additional Information: full citation , abstract , index terms 

As processor speeds increase and memory latency becomes more critical, intelligent desi 
and management of secondary caches becomes increasingly important. The efficiency of 
current set-associative caches is reduced because programs exhibit a non-uniform distrit 
of. memory accesses across different cache sets. We propose a technique to vary the 
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associativity of a cache on a per-set basis in response to the demands of the program. B 
increasing the number of tag-store entries relative to the ... 

8 Web search 2: Detectin g dominant locations from search queries 
^ Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li 
^ August 2005 Proceedings of the 28th annual international ACM SIGIR conference < 
Research and development in information retrieval SIGIR '05 
Publisher: ACM Press 

Full text available: 18 pdf(290.97 Additional Information: full citation , abstract , references , citit 
KB) index terms 

Accurately and effectively detecting the locations where search queries are truly about h 
huge potential impact on increasing search relevance. In this paper, we define a search 
query's dominant location (QDL) and propose a solution to correctly detect it. QDL is 
geographical location(s) associated with a query in collective human knowledge, i.e., ont 
few prominent locations agreed by majority of people who know the answer to the query 
is a subjective and collective attribute of search ... 

Keywords: information retrieval, local search, query's dominant location, search, seard 
query location, search relevance 



Learnin g source-tar g et surface patterns for web-based terminolo g y translation 
Jian-Cheng Wu, Tracy Lin, Jason S. Chang 

June 2005 Proceedings of the ACL 2005 on Interactive poster and demonstration 

sessions ACL '05 
Publisher: Association for Computational Linguistics 

Full text available: HP pdf( 1 1 5.84 AJJ -*- , , « f * ■■ •* *• u , * * 

Additional Information: full citation , abstract , references 

This paper introduces a method for learning to find translation of a given source term on 
Web. In the approach, the source term is used as query and part of patterns to retrieve 
extract translations in Web pages. The method involves using a bilingual term list to lear 
source-target surface patterns. At runtime, the given term is submitted to a search engii 
then the candidate translations are extracted from the returned summaries and subsequ 
ranked based on the surface patterns, oc ... 



1 ^ Fast detection of communication patterns in distributed executions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 

Studies on Collaborative research CASCON '97 
Publisher: IBM Press 

Full text available- fi pdf(421MB) Additional lnform ation: f u" citation , abstract , references , inoj 

* — terms 

Understanding distributed applications is a tedious and difficult task. Visualizations basec 
process-time diagrams are often. used to obtain a better understanding of the execution 
application. The visualization tool we use is Poet, an event tracer developed at the Unive 
of Waterloo. However, these diagrams are often very complex and do not provide the us 
with the desired overview of the application. In. our experience, such tools display repeat 
occurrences of non-trivial commun ... 
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11 GPGPU: general purpose computation on graphics hardware 

^ David Luebke, Mark Harris, Jens Kruger, Tim Purcell, Naga Govindaraju, Ian Buck, Cliff Woe 
^ Aaron Lefohn 

August 2004 ACM SIGGRAPH 2004 Course Notes SIGGRAPH '04 
Publisher: ACM Press 

Full text available: H pdf (63.03 ...... , .. , „ .. 

I^/IB) Additional Information: full citation , abstract , citings 

The graphics processor (GPU) on today's commodity video cards has evolved into an 
extremely powerful and flexible processor. The latest graphics architectures provide 
tremendous memory bandwidth and computational horsepower, with fully programmable 
vertex and pixel processing units that support vector operations up to full IEEE floating p 
precision. High level languages have emerged for graphics hardware, making this 
computational power accessible. Architecturally, GPUs are highly parallel s ... 

12 Search en g ineerin g 1: What's new on the web?: the evolution of the web from a ses 
eng ine perspective 

Alexandras Ntoulas, Junghoo Cho, Christopher Olston 

May 2004 Proceedings of the 13th international conference on World Wide Web W 
•04 

Publisher: ACM Press 

Full text available: H) pdf(502.41 Additional Information: full citation , abstract , references , citir 
KB) index terms 

We seek to gain improved insight into how Web search engines shouldcope with the evol 
Web, in an attempt to provide users with themost up-to-date results possible. For this 
purpose we collectedweekly snapshots of some 150 Web sites over the course of one ye< 
measured the evolution of content and link structure. Our measurements focus on aspec 
potential interest to search engine designers: the evolution of link structure over time, tl 
rate ofcreation of new pages and new distinct c ... 

Keywords: change prediction, degree of change, link structure evolution, rate of change 
search engines, web characterization, web evolution, web pages 



13 Technical pa pers: component technolo g ies: Component rank: relative significance r 
for software component search 

Katsuro Inoue, Reishi Yokomori, Hikaru Fujiwara, Tetsuo Yamamoto, Makoto Matsushita, SI 
Kusumoto 

May 2003 Proceedings of the 25th International Conference on Software Engineer 
ICSE '03 

Publisher: IEEE Computer Society 

Full text available: H pdf(884.54 AJJ . A . llt 

vo\ i§fl D ... . Additional Information: full citation , abstract , references , citir 
m_ W Publisher index terms ~~ 
Site 

Collections of already developed programs are important resources for efficient developn 
of reliable software systems. In this paper, we propose a novel method of ranking softwc 
components, called Component Rank, based on analyzing actual use relations among the 
components'and propagating the significance through the use relations. We have develoi 
component-rank computation system, and applied it to various Java programs. The resul 
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promising such that non-specific and generic ... 

14 Boostin g the performance of Web search engines: Caching and prefetching query 

<& re sults by exploitin g historical usa g e data 

Tiziano Fagni, Raffaele Perego, Fabrizio Silvestri, Salvatore Orlando 

January 2006 ACM Transactions on Information Systems (TOIS), Volume 24 Issue 1 

Publisher: ACM Press 

Full text available: *B Pdf(668.69 Additional Information: full citation , abstract , references , citit 
KB ) index terms 

This article discusses efficiency and effectiveness issues in caching the results of queries 
submitted to a Web search engine (WSE). We propose SDC (Static Dynamic Cache), a n< 
caching strategy aimed to efficiently exploit the temporal and spatial locality present in t 
stream of processed queries. SDC extracts from historical usage data the results of the r 
frequently submitted queries and stores them in a static, read-only portion of the cache, 
remaining entries of the c ... 

Keywords: Caching, Web search engines, multithreading 



15 Computing curricula 2001 

^ September 2001 Journal on Educational Resources in Computing (JERIC) 
Publisher: ACM Press 

Full text available: IS pdf( 61 3.63 ...... .. , , .. ., . . . 

KB)® html Additional Information: full citation , references , citings. inde> 

( 2778 KbT ~ 



16 Quer y term disambi g uation for Web cross-lan g ua g e information retrieval usin g a se 
<|>. engine 

Akira Maeda, Fatiha Sadat, Masatoshi Yoshikawa, Shunsuke Uemura 

November 2000 Proceedings of the fifth international workshop on on Informatior 

retrieval with Asian languages IRAL 'OO 
Publisher: ACM Press 

Full text available:^ pdf (736.31 ...... .. , . .. . A 4 . 

KB) Additional Information: full citation , abstract , references , citir 

With the worldwide growth of the Internet, research on Cross-Language Information Ret 
(CLIR) is being paid much attention. Existing CLIR approaches based on query translatio 
require parallel corpora or comparable corpora for the disambiguation of translated quer 
terms. However, those natural language resources are not readily available. In this pape 
propose a disambiguation method for dictionary-based query translation that is indepenc 
of the availability of such scarce langua ... 

Keywords: WWW, cross-language information retrieval, mutual information, search enc 



17 



Search en gine eng ineerin g : To randomize or not to randomize: space optimal 
summaries for hyperlink analysis 
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Tamas Sarlos, Adras A. Benczur, Karoly Csalogany, Daniel Fogaras, Balazs Racz 
May 2006 Proceedings of the 15th international conference on World Wide Web W 
'06 

Publisher: ACM Press 

Full text available: 1§ pdf(301 .26 Additional Information: full citation , abstract , references . ind< 
KB) terms 

Personalized PageRank expresses link-based page quality around user selected pages. Tl 
only previous personalized PageRank algorithm that can serve on-line queries for an 
unrestricted choice of pages on large graphs is our Monte Carlo algorithm [WAW 2004]. 
this paper we achieve unrestricted personalization by combining rounding and randomize 
sketching techniques in the dynamic programming algorithm of Jeh and Widom [WWW 2 
We evaluate the precision of approximation experimentally on lar ... 

Keywords: data streams, link-analysis, scalability, similarity search 



18 Aggreg ate predicate su p port in DBMS 

Apostol (Paul) Natsev, Gene Y. C. Fuh, Weidong Chen, Chi-Huang Chiu, Jeffrey S. Vitter 
January 2002 Australian Computer Science Communications , Proceedings of the i 
Australasian database conference - Volume 5 ADC '02, Volume 24 issu 
Publisher: Australian Computer Society, Inc., IEEE Computer Society Press 

Full text available: WpdfM .57 MB) Additional lnformation: M^^ . abstract, references , cjtJr 

In this paper we consider aggregate predicates and their support in database systems. 
Aggregate predicates are the predicate equivalent to aggregate functions in that they cai 
used to search for tuples that satisfy some aggregate property over a set of tuples (as 
opposed to simply computing an aggregate property over a set of tuples). The importanc 
aggregate predicates is exemplified by many modern applications that require ranked se 
or top-k queries. Such queries are the norm ... 

Keywords: aggregate predicates, nearest neighbor, query optimization 



19 OOPSLA onward! track: No name: j ust notes on software reuse 

^ Robert Biddle, Angela Martin, James Noble 

v December 2003 ACM sigplan Notices, volume 38 issue 12 

Publisher: ACM Press 

Full text available- B pdf(262 MB) Additional Information: full citation , abstract, references , fifth 

index terms 

In the beginning, so our myths and stories tell us, the programmer created the program 
the eternal nothingness of the void. In this essay, we recognise that programs these day 
like any other assemblage, and suggest that in fact programming has always been about 
reuse. We also explore the nature of reuse, and claim that Components themselves are r 
the most important consideration for reuse; it is the end product, the composition. The i: 
still involve value, investment, and return. ... 

Keywords: components, object-oriented programming, software reuse 
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20 Onward pa pers: No name: j ust notes on software reuse 
^ Robert Biddle, Angela Martin, James Noble 

v October 2003 Companion of the 18th annual ACM SIGPLAN conference on Object- 
oriented programming, systems, languages, and applications OOPSL 



In the beginning, so our myths and stories tell us, the programmer created the program 
the eternal nothingness of the void. In this essay, we recognise that programs these day 
like any other assemblage, and suggest that in fact programming has always been about 
reuse. We also explore the nature of reuse, and claim that Components themselves are r 
the most important consideration for reuse; it is the end product, the composition. The i: 
still involve value, investment, and return. ... 

Keywords: components, object-oriented programming, software reuse 
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Additional Information: full citation , abstract , references . ind< 
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1 Indexing and querying: Three-level cachin g for efficient quer y processin g in large V\ 
^ search en g ines 

Xiaohui Long, Torsten Suel 

May 2005 Proceedings of the 14th international conference on World Wide Web W 
■05 

Publisher: ACM Press 

Full text available: 1>) pdf (243.61 Additional Information: full citation , abstract , references , citii 
KB ) index terms 

Large web search engines have to answer thousands of queries per second with interact! 
response times. Due to the sizes of the data sets involved, often in the range of multiple 
terabytes, a single query may require the processing of hundreds of megabytes or more 
index data. To keep up with this immense workload, large search engines employ clustei 
hundreds or thousands of machines, and a number of techniques such as caching, index 
compression, and index and query pruning are used to im ... 



Keywords: Web search, caching, inverted index 



2 Information Retrieval: Predictive cachin g and prefetchin g of query results in search 
^ eng ines 

Ronny Lempel, Shlomo Moran 

May 2003 Proceedings of the 12th international conference on World Wide Web W 
'03 

Publisher: ACM Press 

Full text available: 1a1 pdf (212.73 Additional Information: full citation , abstract , references , citii 
KB ) index terms 

We study the caching of query result pages in Web search engines. Popular search engin 
receive millions of queries per day, and efficient policies for caching query results may ei 
them to lower their response time and reduce their hardware requirements. We present 
(probability driven cache), a novel scheme tailored for caching search results, that is ba< 
on a probabilistic model of search engine users. We then use a trace of over seven millio 
queries submitted to the search engine A ... 

Keywords: caching, query processing and optimization 
http://portal.acm.org/results.cfm?coll=ACM&dl-ACM&CFID=17694811&CFTOKEN=71604607 4/30/07 
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3 Boostin g the performance of Web search engines: Caching and prefetching query 
results by exploitin g historical usa g e data 
Tiziano Fagni, Raffaele Perego, Fabrizio Silvestri, Salvatore Orlando 
January 2006 ACM Transactions on Information Systems (TOIS), Volume 24 issue l 

Publisher: ACM Press 

Full text available: HP pdf(668.69 Additional Information: full citation , abstract , references , citii 
KB ) index terms 

This article discusses efficiency and effectiveness issues in caching the results of queries 
submitted to a Web search engine (WSE). We propose SDC (Static Dynamic Cache), a n« 
caching strategy aimed to efficiently exploit the temporal and spatial locality present in t 
stream of processed queries. SDC extracts from historical usage data the results of the r 
frequently submitted queries and stores them in a static, read-only portion of the cache, 
remaining entries of the c ... 

Keywords: Caching, Web search engines, multithreading 



4 Rank-preservin g two-level cachin g for scalable search engines 

^ Paricia Correia Saraiva, Edleno Silva de Moura, Novio Ziviani, Wagner Meira, Rodrigo Fonse 

^ Berthier Riberio-Neto 

September 2001 Proceedings of the 24th annual international ACM SIGIR conferei 
on Research and development in information retrieval SIGIR '01 

Publisher: ACM Press 

Full text available:^ pdf(232. 50 Additional Information: full citation , references , citin gs, inde) 
KB ) terms 



5 Quer ying and web: Efficient q uery processin g in geog raphic web search en g ines 
^ Yen-Yu Chen, Torsten Suel, Alexander Markowetz 

^ June 2006 Proceedings of the 2006 ACM SIGMOD international conference on 
Management of data SIGMOD '06 
Publisher: ACM Press 

Full text available: &P_df(296J6 Addjtjona| | nformation: fu „ citation , abstract , references 

Geographic web search engines allow users to constrain and order search results in an 
intuitive manner by focusing a query on a particular geographic region. Geographic sean 
technology, also called local search, has recently received significant interest from major 
search engine companies. Academic research in this area has focused primarily on techn 
for extracting geographic knowledge from the web. In this paper, we study the problem 
efficient query processing in scalable geogr ... 
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6 O ptimizin g result prefetchin g in web search engines with se g mented indices 
^ Ronny Lempel, Shlomo Moran 

V February 2004 ACM Transactions on Internet Technology (TOIT), Volume 4 issue l 
Publisher: ACM Press 
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We study the process in which search engines with segmented indices serve queries. In 
particular, we investigate the number of result pages that search engines should prepare 
during the query processing phase. Search engine users have been observed to browse 
through very few pages of results for queries that they submit. This behavior of users 
suggests that prefetching many results upon processing an initial query is not efficient, s 
most of the prefetched results will not be requested by the ... 

Keywords: Distributed inverted indices, prefetching, search engines 



7 Inverted files for text search en g ines 
^ Justin Zobel, Alistair Moffat 

^ July 2006 ACM Computing Surveys (CSUR), Volume 38 Issue 2 
Publisher: ACM Press 

Full text available: *B pdf(944.29 Additional Information: full citation , abstract , references . ind< 
KB ) terms 

The technology underlying text search engines has advanced dramatically in the past de 
The development of a family of new index representations has led to a wide range of 
innovations in index storage, index construction, and query evaluation. While some of th 
developments have been consolidated in textbooks, many specific techniques are not wii 
known or the textbook descriptions are out of date. In this tutorial, we introduce the key 
techniques in the area, describing both a core impl ... 

Keywords: Inverted file indexing, Web search engine, document database, information 
retrieval, text retrieval 



8 Research session: DB and IR #1: An efficient and versatile query engine for Top X 
search 

Martin Theobald, Ralf Schenkel, Gerhard Weikum 

August 2005 Proceedings of the 31st international conference on Very large data I 

VLDB '05 
Publisher: VLDB Endowment 

Full text available: 18 pdf(442.21 Additional Information: full citation , abstract , references , citit 
KB) index terms 

This paper presents a novel engine, coined TopX, for efficient ranked retrieval of XML 
documents over semistructured but nonschematic data collections. The algorithm follows 
paradigm of threshold algorithms for top-k query processing with a focus on inexpensive 
sequential accesses to index lists and only a few judiciously scheduled random accesses, 
difficulties in applying the existing top-k algorithms to XML data lie in 1) the need to con 
scores for XML elements while aggreg ... 

9 Covera g e, relevance, and ranking: The impact of query operators on Web search e 
<^ results 

Caroline M. Eastman, Bernard J. Jansen 

October 2003 ACM Transactions on Information Systems (TOIS), Volume 21 Issue 4 
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Publisher: ACM Press 

Full text available: T§pdf( 373.50 Additional Information: full citation , abstract , references , citii 
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Research has reported that about lO&percnt; of Web searchers utilize advanced query 
operators, with the other 90&percnt; using extremely simple queries. It is often assume* 
the use of query operators, such as Boolean operators and phrase searching, improves t 
effectiveness of Web searching. We test this assumption by examining the effects of que 
operators on the performance of three major Web search engines. We selected one hunc 
queries from the transaction log of a Web search servic ... 

Keywords: Boolean operators, Relative precision, Web results, coverage, query operate 
ranking, search engines 



10 Browsers and Ul. web en g ineerin g, hy permedia & multimedia, security , and 
^ accessibility: Efficient query subscription processin g for prospective search en g ines 
Utku Irmak, Svilen Mihaylov, Torsten Suel, Samrat Ganguly, Rauf Izmailov 
May 2006 Proceedings of the 15th international conference on World Wide Web VI 
'06 

Publisher: ACM Press 

Full text available: pdf(88.44 Additional Information: full citation , abstract , references . ind< 
KB) terms 

Current web search engines are retrospective in that they limit users to searches againsl 
already existing pages. Prospective search engines, on the other hand, allow users to up 
queries that will be applied to newly discovered pages in the future. We study and comp. 
algorithms for efficiently matching large numbers of simple keyword queries against a st 
of newly discovered pages. 

Keywords: inverted index, prospective search, query processing 



11 Industrial session: Secure search in enterprise webs: tradeoffs in efficient 

im plementation for document level security 
Peter Bailey, David Hawking, Brett Matson 

November 2006 Proceedings of the 15th ACM international conference on Informa 

and knowledge management CIKM '06 
Publisher: ACM Press 

Full text available: 18 pdf( 538.46 Additional Information: full citation , abstract , references . ind< 
KB) terms 

Document level security (DLS) — enforcing permissions prevailing at the time of search • 
specified as a mandatory requirement in many enterprise search applications. Unfortuna 
depending upon implementation details and values of key parameters, DLS may come al 
high price in increased query processing time, leading to an unacceptably slow search 
experience. In this paper we present a model and a method for carrying out secure sean 
the presence of DLS within enterprise webs. We rep ... 

Keywords: access control, caching, collection level security, document level security, 
enterprise search, file systems, performance, scalability, security models 
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12 Best Paper: Early experiences with a 3D model search engine 
^ Patrick Min, John A. Halderman, Michael Kazhdan, Thomas A. Funkhouser 
V March 2003 Proceeding of the eighth international conference on 3D Web technoh 
Web3D '03 
Publisher: ACM Press 

Full text available:® pdf(192JVIB) Additional '"formation: full citation , abstract, references , cjtii 

index terms 

New acquisition and modeling tools make it easier to create 3D models, and affordable a 
powerful graphics hardware makes it easier to use them. As a result, the number of 3D 
models available on the web is increasing rapidly. However, it is still not as easy to find : 
models as it is to find, for example, text documents and images. What is needed is a \3C 
model search engine," a specialized search engine that targets 3D models. We created a 
prototype 3D model search engine to investigate the d ... 

Keywords: 3D model database, shape matching, shape query interfaces, specialized se 
engine 



13 Research pa pers: XML query, update, and search: Efficient keyword search for sm; 

^ LCAs in XML databases 

Yu Xu, Yannis Papakonstantinou 

June 2005 Proceedings of the 2005 ACM SIGMOD international conference on 

Management of data SIGMOD '05 
Publisher: ACM Press 

Full text available: ^^(479,45 Additiona | information: full citation , abstract , references , cjtii 

Keyword search is a proven, user-friendly way to query HTML documents in the World W 
Web. We propose keyword search in XML documents, modeled as labeled trees, and des 
corresponding efficient algorithms. The proposed keyword search returns the set of smal 
trees containing all keywords, where a tree is designated as "smallest" if it contains no t 
that also contains all keywords. Our core contribution, the Indexed Lookup Eager algoritl 
exploits key properties of smallest trees in ... 
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14 Research sessions: Research 28: Search applications: Trustworthy keyword search 
reg ulatory-compliant records retention 

Soumyadeb Mitra, Windsor W. Hsu, Marianne Winslett 

September 2006 Proceedings of the 32nd international conference on Very large d 

bases - Volume 32 VLDB'2006 
Publisher: VLDB Endowment 

Full text available: 1§ pdf(879.26 Additional Information: full citation , abstract , references , ind 
KB). terms 

Recent litigation and intense regulatory focus on secure retention of electronic records h. 
spurred a rush to introduce Write-Once-Read-Many (WORM) storage devices for retainin 
business records such as electronic mail. However, simply storing records in WORM ston 
insuffcient to ensure that the records are trustworthy, i.e., able to provide irrefutable pn 
and accurate details of past events. Specifically, some form of index is needed for timely 
access to the records, but unless the in ... 
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15 A comprehensive project for CS2: combining key data structures and algorithms int 
&> inte g rated web browser and search en g ine 

Tia Newhall, Lisa Meeden 

February 2002 ACM SIGCSE Bulletin , Proceedings of the 33rd SIGCSE technical 

symposium on Computer science education SIGCSE '02, Volume 34 is 
Publisher: ACM Press 

Full text available: H-l pdf( 450.67 ...... .. , .. , .. . , . 4 

KB) Additional Information: full citation , abstract , references , citit 

We present our experience using a large, real-world application as a course project for tl 
second half of the semester of a CS2 course. Our primary goal for the project was to ere 
an engaging application that incorporated most of the key data structures and algorithm 
introduced in the course. Specifically, the project uses binary search trees, priority queu> 
hash tables, and graphs. The project consisted of four parts combined to build an integrc 
web browser and search engine in Java. ... 

16 Automated g atherin g of Web information: An in-depth examination of a g ents interac 
^ with search en g ines 

Bernard J. Jansen, Tracy Mullen, Amanda Spink, Jan Pedersen 

November 2006 ACM Transactions on Internet Technology (TOIT), Volume 6 Issue 4 
Publisher: ACM Press 

Full text available: HH pdf(386.13 Additional Information: full citation , abstract , references , ind 
KB ) terms 

The Web has become a worldwide repository of information which individuals, companie; 
organizations utilize to solve or address various information problems. Many of these We 
users utilize automated agents to gather this information for them. Some assume that tl 
approach represents a more sophisticated method of searching. However, there is little 
research investigating how Web agents search for online information. In this research, w 
first provide a classification for information agent ... 

Keywords: Search engines, Web searching, agent searching 



17 Indexin g and g uerying : Improvin g Web search efficiency via a locality based static 
^ prunin g method 

Edleno S. de Moura, Celia F. dos Santos, Daniel R. Fernandes, Altigran S. Silva, Pavel Calac 
Mario A. Nascimento 

May 2005 Proceedings of the 14th international conference on World Wide Web W 
•05 

Publisher: ACM Press 

Full text available: HP pdfd 75.01 Additional Information: full citation , abstract , references , citit 
KB). index terms 

The unarguably fast, and continuous, growth of the volume of indexed (and indexable) 
documents on the Web poses a great challenge for search engines. This is true regarding 
only search effectiveness but also time and space efficiency. In this paper we present an 
index pruning technique targeted for search engines that addresses the latter issue with< 
disconsidering the former. To this effect, we adopt a new pruning strategy capable of gr* 
reducing the size of search engine indices. Exp ... 
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18 A search en g ine for 3D models 

^ Thomas Funkhouser, Patrick Min, Michael Kazhdan, Joyce Chen, Alex Halderman, David Dol 
^ David Jacobs 

January 2003 ACM Transactions on Graphics (TOG), Volume 22 Issue l 
Publisher: ACM Press 



As the number of 3D models available on the Web grows, there is an increasing need for 
search engine to help people find them. Unfortunately, traditional text-based search 
techniques are not always effective for 3D data. In this article, we investigate new shape 
based search methods. The key challenges are to develop query methods- simple enough 
novice users and matching algorithms robust enough to work for arbitrary polygonal moi 
We present a Web-based search engine system that support ... 

Keywords: Search engine, shape matching, shape representation, shape retrieval 
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Mayank Bawa, Roberto J. Bayardo, Sridhar Rajagopalan, Eugene J. Shekita 

May 2003 Proceedings of the 12th international conference on World Wide Web V\i 



Publisher: ACM Press 
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Personal webservers have proven to be a popular means of sharing files and peer 
collaboration. Unfortunately, the transient availability and rapidly evolving content on su 
hosts render centralized, crawl-based search indices stale and incomplete. To address th 
problem, we propose YouSearch, a distributed search application for personal webserver 
operating within a shared context (e.g., a corporate intranet). With YouSearch, search n 
are always fast, fresh and complete — properties we ... 

Keywords: P2P, decentralized systems, information communities, intranet search, peer 
peer networks, web search 

20 Teachin g key topics in computer science and information systems throu g h a web sc 
eng ine project 

Michael Chau, Zan Huang, Hsinchun Chen 

September 2003 Journal on Educational Resources in Computing (JERIC), Volume 3 
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Advances in computer and Internet technologies have made it more and more important 
information technology professionals to acquire experience in a variety of aspects, indue 
new technologies, system integration, database administration, and project managemen 
provide students with a chance to acquire such skills, we designed a project called "Builc 
Search Engine in 90 Days," in which students were required to build a domain-specific V\i 
search engine in a semester. In this pa ... 

Keywords: education, indexing, web computing, web search engine, web spiders 
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